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Abstract: Searches for a heavy Standard Model Higgs boson focus on the 'gold plated 
mode' where the Higgs decays to two leptonic Z bosons. This channel provides a clean 
signature, in spite of the small leptonic branching ratios. We show that using fat jets the 
semi-leptonic ZZ mode significantly increases the number of signal events with a similar 
statistical significance as the leptonic mode. 
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1. Introduction 

The main task of the LHC is to understand electroweak symmetry breaking, e.g. by 
confirming or modifying the minimal Higgs mechanism of the Standard Model jl], [| . The 
cleanest Higgs signatures arise from Higgs decays to gauge bosons, where the Z || f|, ||, |6| 
or W j|, H|, ^] bosons decay leptonically. Seeing purely hadronic Higgs decays at the LHC is 
an attractive goal [10, [ll|, [l^, p^j , and recent developments in searches for boosted H —¥ bb 



decays are putting us into a promising position [la , O ] . Mixed leptonic and hadronic 
decay products of the Higgs boson appear for example in searches for H —> tt in weak 



boson fusion (WBF) and have a similar reach as the purely leptonic mode jlj], IS, poj . 
Most of the papers listed above focus on a low-mass Higgs boson, but the same question 
we can of course ask for any Higgs mass. 

The reason for the overwhelming interest in a light Higgs boson is that global fits 
to electroweak precision measurements pTl indicate that in the Standard Model the Higgs 



mass ran has to lie below 144 GeV at 95% CL. Direct searches at LEP exclude masses below 



114.5 GeV [22], and CDF and DO report an exclusion of the 163-166 GeV mass window 



after collecting 2.1-5.4 fb _1 of data [23|. In spite of this focus on light Higgs bosons we 
need to keep in mind that all of these measurements include theory assumptions, basically 
that there be no weak-scale modification of our Standard Model with its minimal Higgs 
sector. 

Once we allow for such modifications the Higgs boson might for example become 
heavier. While the lower bounds on the Higgs mass are set by experiment, upper bounds 
arise from theoretical considerations, including the tree-level unitarity requirements [24] 



and the triviality bound pq] . Requiring that the Higgs self coupling remains finite and 
conservatively assuming that the cut-off scale to where the Standard Model remains valid 
extends only to the Higgs boson mass itself, calculations on the lattice give an upper bound 
of ran < 640 GeV pq ]. Somewhere above this mass range the Higgs width will increase 
to a significant fraction of the Higgs mass, so we would not consider such a Higgs state 
fundamental. Such a broad Higgs resonance will generically become hard to observe as a 
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well defined mass peak over backgrounds. In this paper we will focus on the mass range 
300 GeV <m H < 600 GeV. 

At the moment the experiments at the LHC are running and collecting data, but 
collisions at the 14 TeV design energy will not be possible within the next two or three 
years. The significant increase of the center-of-mass energy at the LHC compared to the 
Tevatron, before and after the upgrade to 14 TeV, will extend the Higgs boson exclusion 
and discovery reaches very rapidly. This is particularly obvious for the relatively easy 



intermediate mass range uih > 140 GeV. The gluon- fusion channel 27] yields the biggest 
production cross section for a Standard Model Higgs boson at the LHC. Once the branching 
ratio to Z pairs becomes sizeable the so-called 'gold plated mode' H — > ZZ* — > Al results in 
a very clean final state and allows for a Higgs boson discovery up to run = 600 GeV based 
on 10 fb _1 at yfs = 14 TeV ||. In this mass region roughly 30% of the Higgs bosons decay to 
Z bosons |2q| . The charged lepton mode can be complemented by H — > ZZ* — > l + l^uv ||. 
Unfortunately, the fact that only 6% of the Z bosons decay to electrons or muons means 
that this gold plated mode is strongly statistics limited. Allowing for one of the Z bosons to 
decay hadronically and hence including the 60% hadronic Z decays increase the expected 
number of events. Note that 15% of Z bosons decay to bb pairs, but searching for this 
channel by imposing one or two additional 6-tags contradicts our primary goal of increasing 
the number of signal events compared to the purely leptonic sample. 

The semi-leptonic channel 

pp^H^ZZ^(££)(jj) (1.1) 

has not been given the attention it deserves in the context of heavy Higgs searches. This 
can be partly understood because it is very difficult to compete with the clean leptonic final 
state, and additional backgrounds like Z + jets make the extraction of the semi-leptonic 



signal events a difficult task. We argue that recent developments of subjet techniques ]14 



29, 30, |3]J changes this assumption. If a heavy resonance (H) decays to intermediately 
resonances (Z) which subsequently decay to quarks, the final-state quarks will be highly 
collimated. Thus, the hadronic Z decays can be collected in a 'fat jet'. It has been shown 
for gauge bosons [34], Higgs bosons [14, 15, 16, 17, ^2|, 33], and top quarks [|35], [$(], 17] 
that we can achieve a successful QCD background rejection based on kinematic patterns 
of subjets inside the fat jet. The experimental signature in Eq. ([O]) requires us to first 
reconstruct the boosted Z boson using subjet techniques and then combine the hadronic 
Z boson with a leptonic Z decay to form a Higgs resonance. 

2. The gold plated mode 

To be able to compare our semi-leptonic ZZ chanel with the purely leptonic mode over 



the entire Higgs mass range we reproduce the results for the four-muon final state of [36] 
and find good agreement, see Figure |]. 

Throughout this paper we normalize the total rate for the gluon-fusion Higgs signal 
and the backgrounds to the next-to-leading order predictions. The NLO signal cross section 
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we obtain by scaling the LO value from PYTHIA 6.4 [37] with a factor, K = ctnlo/olo 
from HIGLU p8j. The transverse momentum distribution of the Higgs boson simulated 
with PYTHIA 6.4 approximates the full calculation with POWHEG very well plOfl . Weak 
boson fusion we include in the inclusive signal. The NLO corrections to this production 
process are known to be small pQ|] , so within errors we assume K=1.0. The dominating 
background for the four-muon signature is continuum ZZ production. We simulate this 
background using MadEvent [41] and PYTHIA 6.4. Its NLO cross section comes from 



MCFM |H, giving us 7.39 pb at 7 TeV and 19.02 pb at 14 TeV. 



To select a muon we demand it to be central and sufficiently hard 

M < 2.5, PT, M > 7 GeV for |y M | < 1.1 

p Ttfl > 13 GeV for \y^\ > 1.1. (2.1) 

The muons have to be isolated, that is the hadronic transverse energy in a cone of R = 0.3 
around the lepton has to be ^Thadronic < 0-1 ^T,ti- We accept events with at least four 
isolated muons passing the staggered p^ cuts 

Pt,h > 15, 15, 12, 8 GeV . (2.2) 

The Z bosons we reconstruct combining two oppositely charged isolated muons, requiring 

m z - 10 GeV < m MM < m z + 10 GeV. (2.3) 



For this analysis we consider five different Higgs-boson masses. Because the Higgs 



width grows very fast with the Higgs mass [28| we widen the mass windows for a recon- 
struction according to 



(300 ± 30, 350 ± 50, 400 ± 50, 500 ± 70, 600 ± 100) GeV. 



(2.4) 



The mass windows are completely dominated by the physical Higgs width. Detector effects 
like the lepton or jet energy scale will have only little effect, which means we keep the 
window for reconstructed Higgs mass for the leptonic and the semi-leptonic analyses. 





7 TeV 


14 TeV 


m H [GeV] 


*S [fb] 


<tb [fb] 


S/B 


s/Vb 10 


°S [fb] 


ob [fb] 


S/B 


s/Vb iq 


300 


0.35 


0.42 


0.8 


1.7 


1.39 


0.56 


2.5 


5.9 


350 


0.35 


0.38 


0.9 


1.8 


1.52 


0.53 


2.9 


6.6 


400 


0.28 


0.21 


1.3 


1.9 


1.34 


0.31 


4.4 


7.6 


500 


0.11 


0.11 


1.0 


1.1 


0.65 


0.18 


3.7 


4.9 


600 


0.05 


0.07 


0.7 


0.6 


0.30 


0.12 


2.5 


2.7 



Table 1: Signal and background cross sections for the purely leptonic H — > ZZ analysis. The final 
significance we compute for 10 fb -1 . 
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Figure 1: Invariant mass distribution of the reconstructed Z bosons in the semi-leptonic channel 
at 7 TeV (left) and 14 TeV (right) collider energy. 



The purely leptonic channel is very clean and with four relatively hard muons not 
plagued by large background rates or large background uncertainties. Systematic errors 
should not be a problem since for a/s = 14 TeV we find an outstanding signal-to-background 
ratio of S/B > 1 over the whole mass region. The results for collider energies of 7 TeV 
and 14 TeV we list in Table g. The main distinguishing feature of signal and background 
is the four-muon invariant mass which we show in Figure g. Its signal shape is clearly 
distinguishable from the background, which makes this channel a save bet for a data 
driven side-bin analysis. While the significances shown for 7 TeV running will hardly give 
us an evidence for a heavy Higgs, at an energy of 14 TeV a discovery based on a modest 
integrated luminosity should not be a problem. 



3. The semi-leptonic channel 

For the semi-leptonic signature pp — > H — > jj££ we need to consider Z + jets, ZZ, tt 
and WZ backgrounds. For the main background Z + jets the NLO rate after requiring 



PTjet > 100 GeV is 33.91 (9.94) pb for a collider energy of 14 (7) TeV @. The NLO 



normalization of the tt rate is 875 (157.50) pb 43] while for WZ production we find 43.44 
(17.31) f5| . The ZZ background corresponds to the numbers quoted in Section 

If a heavy Higgs boson decays to two Z bosons the Higgs mass generates sizeable 
kinetic energy for the Z bosons. In Figure^ we see that 70% of the leading Z bosons have 
Pt > 150 GeV for tuh = 400 GeV. The geometric distance between the Z decay jets is 
roughly ARj lt j 2 ~ 2mz/pT, which means that the inclusive Cambridge- Aachen (C-A) jet 
algorithm [44] with R = 1.2 should be able to collect all Z decay products in a fat jet. 



Our analysis is based on grouping all final state particles after showering and hadroniza- 
tion into detector cells of size Ar] x A(j) = 0.1 x 0.1. This simulates the finite resolution 
and thresholds of a calorimeter. We then combine all particles in a cell and re-scale their 
total three- momentum such that each cell has zero invariant mass. Only cells with energy 
above 0.5 GeV we cluster into jets. The results of our analysis we show for the following 
individual steps 
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Figure 2: pr distribution of the leading Z boson for different Higgs masses mjj, assuming y/s 
I 1 TeV. 



Fat jet — Our fat jet requirement on this calorimeter simulation uses the C-A algorithm 
implemented in Fast Jet [^] with R = 1.2. For this jet we require \yj\ < 2 and 
p Tj > 150 GeV. 

Leptonic Z reconstruction — As part of our selection cuts we ask for exactly two iso- 
lated muons with pt > 15 GeV and |r/| < 2.5. Their invariant mass has to match 
m z ± 10 GeV. 



Hadronic Z reconstruction — To reconstruct the hadronic Z we follow Ref. [14]. For 
the hardest jet in the event we undo the last stage of clustering. The two result- 
ing subjets in the splitting j — > j\j2 are labeled such that nij 1 > rrij 2 . If there 
is a significant mass drop, rrij 1 < frnij, and the splitting is not too asymmetric, 
y = Ai?| x j 2 min(j>y j 1 :Pr j 2 ) > 2/cutW/> the jet j is expected to be the resonance's 
neighborhood and the declustering stops, otherwise redefine j to be equal to j±. This 
process continues until the mass drop condition is met. If this does not happen the 
event is removed. We choose fi = 0.67 and y cut = 0.09. Varying = 0.33 — 0.67 does 
not improve S/y/B. After the mass drop condition is met we filter the fat jet |l4|]: the 
constituents of the two subjets which survive the mass drop condition are recombined 
with the higher resolution Rfm = min(0.3, ARj 1 j 2 /2) and the three hardest filtered 
subjets are required to give m r £ c = mz ± 10 GeV. 

Higgs reconstruction — If both Z bosons in the signal are correctly reconstructed their 
invariant mass peaks around the Higgs boson mass, m 2 H = (pz,iep +Pz,had) 2 - The 
shape of the mu distribution is determined by the width of the Higgs boson and the 
ability of the algorithm to remove underlying event and initial state radiation from 
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the hadronic Z reconstruction. In practice, such an analysis would be combined with 
a likelihood fit or other elaborate statistical methods, taking into account systematic 
uncertainties. This is beyond the scope of this paper. Our choice of Higgs mass 
windows, Eq.(|2.4[), should give us a conservative estimate of the prospects of such an 
analysis. 

ZZ separation — After reconstructing the Higgs boson with a leptonic and a hadronic 
Z boson S/B can be further improved by requiring a maximum angular separation 
of ARzz < 3.2. For Z + jets the angular separation of the reconstructed leptonic Z 
and the fake-Z from QCD jets often becomes large, to accomodate the large invariant 
(Higgs) mass. A similar effect we could achieve by scaling the pi cut on the hardest 
jet to higher values for larger Higgs masses. 



pruning + trimming — We know that a combination of pruning M, and trim- 



ming f3lT| helps discriminating the decay products of a color singlet resonance from 
QCD jets All events passing the two Z tags and the Higgs reconstruction we 

re-process using pruning and trimming on the massless cells of the event [16]. For the 



pruning we use the C-A algorithm. For each pair of protojets to be combined we test 
if ARij > m fat j e t/pT,fat jet and min(p Ti j,p T ,j)/pr,i+j > 0.1. If both conditions hold 
true, the merging is vetoed and we discard the softer protojet. For the trimming we 



use the anti-fc;r algorithm [46] to define the fat jet and the inclusive kx algorithm [47] 
with a small cone R = 0.2 for the subjet recombination. During trimming we keep 
all subjets with PT subjct > 0.03 pr fat jct • Only if the pruned and trimmed masses of 
the leading jet are in the range mz ± 10 GeV we accept the event. 

In Table § we show the results for our LHC analysis for each of these steps. We 
separately give the gluon fusion and weak boson fusion signal rates and the background 
cross sections. After the reconstruction of the leptonic Z and requiring > 150 GeV for 
the leading jet the Z + jets background still exceeds the signal by roughly a factor 1000. 
The hadronic Z reconstruction in combination with the Higgs mass condition reduces this 
background tremendously and leaves us with typically S/B > 1/10. Especially for a heavy 
Higgs boson the ARzz cut proofs efficient against the Z+jets background. Finally, the 
combined pruning and trimming on the hadronic Z improves S/ B over the whole considered 
Higgs mass range. The significance quoted can even further improved by including electrons 
in leptonic Z reconstruction. 

It is interesting to track the relative contributions of the gluon fusion and the weak 
boson fusion contributions to the inclusive signal. For small Higgs masses at 14 TeV collider 
energy the acceptance cuts leave us with a 80%-20% balance of the two channels. This 
enhancement as compared to the total rates is due to the generically larger Higgs transverse 
momentum in weak boson fusion, even if we do not cut on the tagging jets. For intermediate 
masses the weak boson fusion contribution drops to a 90%-10% ratio, because the Higgs 
transverse momentum of pt,h = 0(mw) does not help to significantly boost the Z decay 
products. Both channels are pushed far into their pt,h tails by the acceptance cuts. For 
large Higgs masses we know that the relative rate of weak boson fusion as compared to 
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m H [GeV] 


300 


400 


500 


600 


a [fb] 


cs OB 


os ^b 


os ob 


os ob 


selection 

^had 

AR ZZ 
prun/trim 


3.37/0.89 907.3 
0.79/0.22 27.11 
0.46/0.17 1.02 
0.45/0.17 1.00 
0.29/0.12 0.39 


8.89/0.97 907.3 
3.81/0.42 27.11 
3.35/0.35 9.50 
2.99/0.35 7.93 
2.02/0.24 3.97 


4.91/0.70 907.3 
2.36/0.35 27.11 
1.98/0.28 10.53 
1.52/0.28 6.52 
1.11/0.18 3.33 


2.19/0.46 907.3 
1.11/0.25 27.11 
0.88/0.20 8.08 
0.60/0.15 3.82 
0.46/0.12 1.97 


S/B 

s/Vb 10 


1.03 
2.0 


0.57 
3.6 


0.39 
2.2 


0.30 
1.3 


selection 

^had 

mg c 

prun/trim 


17.97/3.83 6200 
3.80/1.00 180.0 
2.21/0.76 6.56 
2.18/0.76 6.45 
1.34/0.48 2.10 


46.18/4.64 6200 
18.03/2.03 180.0 
15.50/1.65 61.47 
13.94/1.55 52.22 
8.96/1.07 19.21 


29.48/3.87 6200 
13.49/1.98 180.0 
11.27/1.56 69.09 
8.98/1.35 45.14 
6.32/1.00 18.01 


15.08/2.90 6200 
7.24/1.62 180.0 
5.75/1.24 54.16 
4.19/0.98 27.89 
3.15/0.77 11.83 


S/B 

s/Vb 10 


0.87 
4.0 


0.52 
7.2 


0.41 
5.5 


0.33 
3.6 



Table 2: Signal and backgrounds for the semi-leptonic fat-jet analysis for a collider energy of 
7 TeV (upper) and 14 TeV (lower). The expected significance is calculated for 10 fb -1 . We show 
gluon fusion (left) and WBF (right) contributions separately for the signal cross sections. For the 
numbers of the expected significance we take both contributions into account. 



gluon fusion increases because of a logarithmic enhancement. This effect increases the 
relative weight of weak boson fusion back to 80%-20%. Of the different cuts only the 
hadronic Z reconstruction shows a bias towards weak boson fusion, because the fat jet 
reconstruction is expected to benefit from the lower jet activity in this channel [17]. The 



final contribution from weak boson fusion ranges from 15% for 400 GeV < mu < 500 GeV 
to 30% for either smaller or larger Higgs masses. This weak boson fusion contribution we 
expect to be a major handle for improving our results using advanced analysis methods. 
While here we do not make use of any of its kinematic features to suppress backgrounds 
an neural net could clearly include them. 

In Figure |3| we show the reconstructed Higgs masses after pruning and trimming. The 
signal excess over backgrounds is clearly visible for ran = 300 — 500 GeV. By asking for 
boosted Z bosons with a large angular separation we slightly shape the dominant Z + jets 
background and generate a maximum around 450 GeV which should be taken into account 
in a side-bin analysis. 

As also shown in Table || the cross sections for yfs = 7 TeV are too small to allow for 
a Higgs discovery with early data. However, in new physics scenarios with a modified ggH 
coupling this might change. A straightforward example is a chiral fourth generation for 
which electroweak precision data favors Higgs masses between 300 and 500 GeV |^]. Its 
loop contribution enhances the ggH coupling by roughly a factor three. For an early LHC 
run at 7 TeV collecting 1 fb -1 integrated luminosity our semi-hadronic analysis could then 
give 15 signal versus 4 background events for mu = 400 GeV. 
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Figure 3: Invariant mass distribution of the reconstructed Z bosons, razz, in the semi-leptonic 
channel at 7 TeV (left) and 14 TeV (right) center-of-mass energy. A cut for the maximum angular 
separation of the Z bosons has been applied, Ai?z Iop! .z had , as well as the combined usage of pruning 
and trimming. The signal consists of two parts. In every bin the top part shows the contribution 
of the WBF production process and the lower part the gluon fusion production process. 



4. Conclusions 



We have shown that fat jet techniques will allow us to extract semi-leptonic H — > ZZ 
decays at the LHC. To discriminate the signal from the large Z + jets background we 
use a combination of mass drop searches and filtering based on large light-flavor C-A 
jets, as previously proposed to reconstruct a hadronic Higgs decay [|14|]. For Higgs masses 
between 350 GeV and 500 GeV a successive reconstruction of the two Z bosons and the 
Higgs boson extracts the inclusive signal at the 5<r level based on 10 fb _1 at a 14 TeV 
LHC. Using additional information on the QCD structure of the event by employing a 
combined pruning/trimming analysis gives us typical signal-to-background ratios S/B ~ 
1/2. We suggest that a thorough analysis of jet substructure techniques using early data 
in comparison to Monte-Carlo predictions is performed in the near future. 

Comparing our results to the purely leptonic ZZ channel at 14 TeV collider energy 
the leptonic signal on the one hand achieves S/B > 1 while the semi-leptonic analysis 
only reaches S/B ~ 0.33 — 0.87. On the other hand, this is compensated by the larger 
number of signal events in the semihadronic channel. The semihadronic channel, which 
has never been considered to be a Higgs boson discovery channel before, can have as 
much statistical significance as the purely leptonic 'gold plated' mode. Heavy Higgs boson 
detection might greatly benefit from the orthogonal strength of our semi-leptonic ZZ — > 
iljj search, especially if for some reason the LHC should fall short of the full design energy 
or luminosity. 
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